Picture for Yuqing Yang

Yuqing Yang

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Add code
May 05, 2025
Viaarxiv icon

Empowering Agentic Video Analytics Systems with Video Language Models

Add code
May 02, 2025
Viaarxiv icon

Zoomer: Adaptive Image Focus Optimization for Black-box MLLM

Add code
Apr 30, 2025
Viaarxiv icon

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Add code
Apr 22, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Image Super-Resolution ($\times$4): Methods and Results

Add code
Apr 20, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results

Add code
Apr 14, 2025
Viaarxiv icon

Scaling Up On-Device LLMs via Active-Weight Swapping Between DRAM and Flash

Add code
Apr 11, 2025
Viaarxiv icon

Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment

Add code
Mar 21, 2025
Viaarxiv icon

HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models

Add code
Mar 14, 2025
Viaarxiv icon

Towards Explainable Doctor Recommendation with Large Language Models

Add code
Mar 04, 2025
Viaarxiv icon